Detection of variable length anomalous subsequences in data streams

نویسندگان

  • Amany Abou Safia
  • Zaher Al Aghbari
چکیده

We consider the problem of anomaly detection in data streams, which is the problem of extracting subsequences that do not match an expected behaviour. The main challenge for detecting anomalous subsequences from data streams in the existing techniques is to determine the lengths of the normal and anomalous subsequences. Therefore, creating a robust model for detecting the anomalous subsequences is of critical importance. In this paper, we propose an incremental algorithm based on the dynamic time warping technique to detect anomalous subsequences in data streams. The proposed algorithm is able to detect anomalous subsequences under relaxed length constrains of the normal and/or the anomalous subsequences. That is the proposed algorithm is able to detect variable length anomalous subsequences from among variable length normal sequences. The proposed robust model can be applied in areas such as system health monitoring, event detection in sensor networks, and detecting eco-system disturbances, etc. The cost of the proposed algorithm is linear with time and memory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams

In recent years, new applications emerged that produce data streams, such as stock data and sensor networks. Therefore, finding frequent subsequences, or clusters of subsequences, in data streams is an essential task in data mining. Data streams are continuous in nature, unbounded in size and have a high arrival rate. Due to these characteristics, traditional clustering algorithms fail to effec...

متن کامل

Real-time Bayesian Anomaly Detection for Environmental Sensor Data

Recent advances in sensor technology are facilitating the deployment of sensors into the environment that can produce measurements at high spatial and/or temporal resolutions. Not only can these data be used to better characterize systems for improved modeling, but they can also be used to produce better understandings of the mechanisms of environmental processes. One such use of these data is ...

متن کامل

Effective Outlier Detection using K-Nearest Neighbor Data Distributions: Unsupervised Exploratory Mining of Non-Stationarity in Data Streams

We describe approaches and preliminary experiments that are aimed at monitoring and detecting change in self-monitored data streams. We introduce a new algorithm for outlier detection using K-Nearest Neighbor Data Distributions. We run experiments on a variety of data stream topologies and thereby demonstrate the effectiveness of the new algorithm in detecting outliers and in quantitatively est...

متن کامل

A novel sequence representation for unsupervised analysis of human activities

We present a novel activity representation as bags of event n-grams to extract global structural information of activities using their local event statistics. Exploiting this representation, we present a computational framework for unsupervised activityclass discovery, activity classification and anomalous activity detection. To this end, we model activity-classes as maximally similar activity-...

متن کامل

Entropy Based Adaptive Outlier Detection Technique for Data Streams

Outlier detection in data streams is an immensely enthralling problem in many application areas such as network intrusion detection, faulty sensor detection, fraud detection in online financial transactions etc. Majority of existing outlier detection techniques have been mainly designed for static datasets and require a global view and multiple scans of data which is not feasible in case of str...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJIIDS

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2012